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Abstract 

The majority of compute time doing lattice QCD is spent inverting the fermion matrix. The 
time that this takes increases with the condition number of the matrix. The FLIC (Fat Link Irrele- 
vant Clover) action displays, among other properties, an improved condition number compared to 
standard actions and hence is of interest due to potential compute time savings. However, due to 
its two different link sets there is a factor of two cost in floating point multiplications compared to 
the Wilson action. An additional factor of two has been attributed due to the loss of the so-called 
spin projection trick. We show that any split-link action may be written in terms of spin projec- 
tors, reducing the additional cost to at most a factor of two. Also, we review an efficient means of 
evaluating the clover term, which is additional expense not present in the Wilson action. 
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PACS numbers: 



INTRODUCTION 



The FLIC (Fat Link Irrelevant Clover) 
action has become of interest recently as 
an alternative to standard actions (such as 
Wilson or Clover) due to its superior con- 
dition number 0. This allows for more ef- 
ficient fermion matrix inversion which is 
used in the calculation of propagators and dy- 
namical configurations, and in evaluating the 
matrix sign function in the overlap fermion 
formalism^, Hence, actions with an im- 
proved condition number have the potential 
to save significant compute time. 

To begin, we review the spin-projection 
trick [^] for the Wilson action, which utilises 
projection operators in spinor space to re- 
duce the computation required to evaluate 
the Wilson action. We then generalise this 
trick to the broader class of split-link actions. 
Finally, we examine the FLIC action specifi- 
cally, and discuss a similar trick for reducing 
the cost of evaluating the clover term. 
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II. STANDARD 
TRICK 



SPIN-PROJECTION 



The Wilson operator 
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I^w = y+^A + m (1) 
can be written as 
{D^ij)^ = (4+m)V^,-^ J](l-7^)?7^(x)V^, 



+ (l + ^,)Ulix))ij,., 
= (4 + m)^^. - U^{x)T-tlj^+f, 

+ f/t(x)r+^._^, (2) 

where we have defined the spin projectors 



rj = ^(i±7.). 



(3) 



If we now examine, for example, F^ we see 
that 



^ 2 
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(4) 



Typeset by REVT^ 
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Similar expressions for /j, = 1, 3, 4 allow us to 
deduce that we only need to evaluate the ac- 
tion of the links on the upper half (in spinor 
space) of T'^ipx^fj,, as the lower components 
are equal to the upper components multi- 
plied by ±1 or ±i. In doing so we can halve 
the number of floating point multiplications 
needed in the evaluation of D^, and also re- 
duce intermediate memory usage. This trick 
can be applied in any of the standard repre- 
sentations for the Euclidean-space 7 matri- 
ces. 

III. GENERALISED SPIN-PROJECT- 
ION TRICK 

Wc now consider the case where there arc 
two sets of links, Ufj_{x) for the naive Dirac op- 
erator y and U'^{x) for the irrelevant Wilson 
term (denoted by A' to indicate that it con- 
tains only the links U'). In the case of a FLIC 
action the irrelevant links are APE-smeared, 



but what follows is perfectly general and does 
not depend upon any particular relationship 
between U and U' . Now our "split-link" op- 
erator is 

(ApiitV')x = ((y+ ^A' + m)V'), 
= (4 + m)V^^ - ^ ^ {%{x) - jf,U^{x))iP^+f, 

+ {U'i{x)+^,Ul{x))^|J,.,. (5) 

We can observe that our projectors do not 
present themselves immediately as they did 
before. At this point, compared to the stan- 
dard Wilson action, we must perform four 
times as many floating point multiplications, 
two for the split links, and two for the loss of 
the spin projectors. However, we have 

i = r+ + r;and7M = r+-r;, (6) 

which implies 



(ApiitV')x = (4 + m)iP, - - ^ (ulix) - u^{x))r+ip,+^ + (ulix) + C/^(x))r;V'.+M+ 

{U'l(^) + Ul{x))T;^|;,., + {U'l{x) - t/t(x))r;V^._^. (7) 
It is now clear that by defining symmetrised and anti-symmetrised links, 

U^ix) = \{Ulix) + U,{x)) and C/;(x) = ^([/^(a;) - U,{x)) (8) 

we can write 

(i^spiitV^). = (4+m)v^,-^ t/;(x)rJV',+^+[/;(a:)r;V',+^+t/;t(a;)rJ^,_^+t/;t(a;)r;V',_^. 



Immediately we see that the Wilson spin pro- 
jection trick is simply a special case of the 
split link trick where U = U' . The same sav- 
ing in multiplications that we received in the 
Wilson case applies here, so we have in prin- 



ciple a factor of two compared to the Wilson 

action because U~ is not zero. In actuality, 
efficient cache usage will reduce this to less 
than a factor of two. 



IV. THE FLIC FERMION ACTION 

The FLIC action is a split-link action with clover term0, 

Z^fli, = y+i(A'-^a-F)+m, (10) 

where 

(^^.u = ]pt,.lul F^^(x) = i(C^^(a;) -Cj^(x)), (11) 

C^,{x) = ^ (U^uix) + t/_,^(x) + U,-^{x) + U^^-u{x)) . (12) 

APE-smearing0, ||, |10| is carried out on the individual links in the irrelevant operators 
by making the replacement 

M^)^Ul''\^)=^[ic^-W,^ix) + ^ Yl U,ix)U^ix + aeM{x + ae^)] ■ (13) 



Here V denotes projection of the RHS of Eq. 
(p!3|) back to the SU(3) gauge group. That 
is, each link is modified by replacing it with 
a combination of itself and the surrounding 
staples to give a set of "fat links". The 
means by which one projects back to SU(3) 
is not unique. We choose an SU(3) matrix 
U^\x) such that the gauge invariant mea- 
sure IieTr{U^\x)Xj^{x)) is maximal, where 
X^(x) is the smeared link before projection, 
that is ulf\x) = VXn{x). As the pro- 
cess of APE-smearing removes short-distance 
physics, it is preferable to only smear the ir- 
relevant operators. 

Here a is the smearing fraction and nape is 
the number of smearing sweeps (|l^) we per- 
form. Finally, as in we can perform tad- 
pole or mean-field improvement (MFI) 
to bring our links closer to unity. This con- 
sists of updating each link with a division by 
the mean link, which is the fourth root of the 
average plaquette. 



uo = (-ReTrf/^,(x)),V<.- (14) 

For completeness, we review a (well- 
known) similar trick for the clover term that 



exploits the structure of 0"^,^. In the evalua- 
tion of the clover term, we note that in the 
chiral representation of 7 matrices. 



74 



1 

1 
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1 

-1 



(15) 



the matrix satisfies the following (in 2 x 2 
block notation). 



0"12 



0"34 



and (Ti4 



X 



,0-13 



cr24 



-X 



0"23 



X 



(16) 



So we have, for example. 



-^12^^12 1 ^ 1 +-^34Cr34 



(F12 - ^34)0-12^/' 

(F12 + Fs4)ai2X 
(17) 

This means that if we store the combina- 
tions F12 ± F34, Fi3 ± F24, Fi4 ± F23 we can 
halve the number of floating point multipli- 
cations needed in the evaluation of the clover 
term, further improving the computational 
efficiency of the FLIC action. 
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V. CONCLUSION 

We have presented a generalised version 
of the spin-projection trick which is apphca- 
ble to any spht-hnk action. This allows us 
to halve the number of floating-point multi- 
plications the the evaluation of the action of 
the links upon the fermion field. We have 
also recalled some symmetries of (jp^p in the 
chiral 7 matrix representation which allow us 
to perform a similar cost reduction in the 
evaluation of the clover term. The results 
presented here reduce the cost of evaluating 
the FLIC action to about twice that of the 
standard Wilson action. The exact difference 
will vary depending upon the base architec- 
ture, but on our architecture we have verified 
that the cost of FLIC is almost exactly twice 
that of the Wilson, including the cost of the 
clover term. Additionally, the formulation of 
the split link action in (|^) allows groups who 



have efficient code for the Wilson action to 
simply implement efficient code for the FLIC 
action. Given the benefits of the FLIC ac- 
tion we hope that this work encourages 
groups to consider using the FLIC action for 
their calculations. 
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